RAW SEQUENCE LISTING DATE: 09/17/2004 

PATENT APPLICATION: US/10/824,131 TIME: 09:05:07 



ENTERED 



Input Set : N: \Crf 3\RULE60\10824131 , raw. txt 
Output Set: N:\CRF4\09172004\J824131.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Palmer, Leslie M. 

7 (ii) TITLE OF INVENTION: Novel ribG 

10 (iii) NUMBER OF SEQUENCES: 7 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Dechert Price & Rhoads 

14 (B) STREET: 4000 Bell Atlantic Tower, 1717 Arch Stre 

15 (C) CITY: Philadelphia 

16 (D) STATE: PA f '^'"^''"^'"''''-^ 

17 (E) COUNTRY: US 

18 (F) ZIP: 19103 

20 (v) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Diskette 

22 (B) COMPUTER: IBM Compatible 

23 (C) OPERATING SYSTEM: DOS * -'^ -^^ '"'^^ 

24 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
26 (vi) CURRENT APPLICATION DATA: 

C--> 27 (A) APPLICATION NUMBER: US/10/824 ^ 131 

C--> 28 (B) FILING DATE: 14-Apr-2004 

29 (C) CLASSIFICATION: 

31 (vii) PRIOR APPLICATION DATA: 

W--> 32 (A) APPLICATION NUMBER: US/09/376 , 633 

33 (B) FILING DATE: 1999-August- 18 

W--> 34 (A) APPLICATION NUMBER: 08/978,456 

35 (B) FILING DATE: 

39 (viii) ATTORNEY/AGENT INFORMATION: 

40 (A) NAME: Dickinson, Todd Q 

41 (B) REGISTRATION NUMBER: 28,354 

42 (C) REFERENCE/DOCKET NUMBER: P50444-9 

44 (ix) TELECOMMUNICATION INFORMATION: 

45 (A) TELEPHONE: 215-994-2252 

46 (B) TELEFAX: 215-994-2222 

47 (C) TELEX: 

50 (2) INFORMATION FOR SEQ ID NO: 1: 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 102 9 base pairs 

54 (B) TYPE: nucleic acid 

55 (C) STRANDEDNESS : double 

56 (D) TOPOLOGY: linear 

59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

61 ATGGATTATG CGATTCAACT TGCAAATATG GTACAAGGTC AAACAGGTGT TAATCCACCC 

62 GTTGGCGCTG TTGTAGTTAA TGAAGGTAGG ATTGTTGGTA TTGGTGCACA CTTGAGAAAA 
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63 GGTGACAAGC ATGCGGAGGT TCAAGCACTT GATATGGCAC AACAAAATGC TGAAGGTGCG 180 

64 ACGATTTATA TTACGTTAGA GCCATGTAGT CATTTTGGTT CAACACCACC CTGTGTTAAC 240 

65 AAAATTATTG ATTGTAAGAT AGCAAAAGTA GTATACGCAA CAAAAGACAA TTCGTTAGAC 3 00 

66 ACACATGGTG ATGAGACGTT ACGGGCTCAC GGTATTGAGG TTGAATGCGT TGATGATGAA 360 

67 CGGGCATCAC AATTATACCA AGACTTTTTT AAAGCAAAAG CAAAGCAACT GCCACAAATT 420 

68 ACAGTGAAAG TATCTGCAAG TTTAGATGGT AAACAAGCGA ATGATAATGG ACAAAGTCAA 480 

69 TGGATTACTA ACAAAGAGGT TAAACAAGAT GTCTATAAGT TAAGACATCG ACACGACGCA 540 

70 GTGTTAACTG GAAGACGTAC AGTTGAATTA GATGATCCAC AATATACTAC ACGTATTCAA 600 

71 GATGGAAAAA ACCCTATAAA AGTAATATTG TCTAAGTCTG GGAATATTCA TTTTAATCAG 660 

72 CAAATTTATC AAGATGAATC AACACCAATT TGGATATATA CTGAAAATCC AAATTTAACA 720 

73 AGCAATCAAA CACATATTGA AATTATTTAC TTGAAGTCTT GTGATTTAAC AACAATTCTT 780 

74 CACAATTTAT ATAAAAGAGG AGTTGGAACT TTGCTAGTCG AGGCAGGTCC AACCACTACT 840 

75 TCAGAATTCT CCATCTATTA TATAGATGAA TTTATTCTCT ATTATGCCCC GAAATTAATT 900 

76 GGCGGATCTG GAAATTATCA ATTTTATCAA ACAAATGATG TGATTGAGAT ACCAGATGCG 960 

77 AACCAATTTG AAATTGTTCA TTCCGAGTTA TTAAATCAAA ATGTTAAATT AACTTTACGA 1020 

78 AAGAAGTGA 1029 
80 (2) INFORMATION FOR SEQ ID NO: 2: 

82 (i) SEQUENCE CHARACTERISTICS: 

83 (A) LENGTH: 342 amino acids 

84 (B) TYPE: amino acid 

85 (C) STRANDEDNESS : single 

86 (D) TOPOLOGY: linear 

89 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

91 Met Asp Tyr Ala lie Gin Leu Ala Asn Met Val Gin Gly Gin Thr Gly 

92 1 5 10 15 

93 Val Asn Pro Pro Val Gly Ala Val Val Val Asn Glu Gly Arg He Val 

94 20 25 30 

95 Gly He Gly Ala His Leu Arg Lys Gly Asp Lys His Ala Glu Val Gin 

96 35 40 45 

97 Ala Leu Asp Met Ala Gin Gin Asn Ala Glu Gly Ala Thr He Tyr He 

98 50 55 60 

99 Thr Leu Glu Pro Cys Ser His Phe Gly Ser Thr Pro Pro Cys Val Asn 

100 65 70 75 80 

101 Lys He He Asp Cys Lys He Ala Lys Val Val Tyr Ala Thr Lys Asp 

102 85 90 95 

103 Asn Ser Leu Asp Thr His Gly Asp Glu Thr Leu Arg Ala His Gly He 

104 100 105 110 

105 Glu Val Glu Cys Val Asp Asp Glu Arg Ala Ser Gin Leu Tyr Gin Asp 

106 115 120 125 

107 Phe Phe Lys Ala Lys Ala Lys Gin Leu Pro Gin He Thr Val Lys Val 

108 130 135 140 

109 Ser Ala Ser Leu Asp Gly Lys Gin Ala Asn Asp Asn Gly Gin Ser Gin 

110 145 150 155 160 

111 Trp He Thr Asn Lys Glu Val Lys Gin Asp Val Tyr Lys Leu Arg His 

112 165 170 175 

113 Arg His Asp Ala Val Leu Thr Gly Arg Arg Thr Val Glu Leu Asp Asp 

114 180 185 190 

115 Pro Gin Tyr Thr Thr Arg He Gin Asp Gly Lys Asn Pro He Lys Val 

116 195 200 205 
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117 He Leu Ser Lys Ser Gly Asn He His Phe Asn Gin Gin He Tyr Gin 

118 210 215 220 

119 Asp Glu Ser Thr Pro He Trp He Tyr Thr Glu Asn Pro Asn Leu Thr 

120 225 230 235 240 

121 Ser Asn Gin Thr His He Glu He He Tyr Leu Lys Ser Cys Asp Leu 

122 245 250 255 

123 Thr Thr He Leu His Asn Leu Tyr Lys Arg Gly Val Gly Thr Leu Leu 

124 260 265 ~ 270 

125 Val Glu Ala Gly Pro Thr Thr Thr Ser Glu Phe Ser He Tyr Tyr He 

126 275 280 285 

127 Asp Glu Phe He Leu Tyr Tyr Ala Pro Lys Leu He Gly Gly Ser Gly 

128 290 295 300 

12 9 Asn Tyr Gin Phe Tyr Gin Thr Asn Asp Val He Glu He Pro Asp Ala 

130 305 310 315 320 

131 Asn Gin Phe Glu He Val His Ser Glu Leu Leu Asn Gin Asn Val Lys 

132 325 330 335 

133 Leu Thr Leu Arg Lys Lys 

134 340 

136 (2) INFORMATION FOR SEQ ID NO: 3: 

138 (i) SEQUENCE CHARACTERISTICS: 

139 (A) LENGTH: 1269 base pairs 

140 (B) TYPE: nucleic acid 

141 (C) STRANDEDNESS : double 

142 (D) TOPOLOGY: linear 

145 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

147 AANCACCAAT CCNATTGGGA GGNAATCCAA ATCAATNCCC GGANNCCCAA TCCAAGTTAA 6 0 

14 8 TTAAGTCCAA GGTTTTGGAA CATTACCAAA TATGATTCCG ATGAGGTCAA ATGNCAANCG 12 0 

14 9 GTGTTAATAA ACTACGAAAT GNTGTGNAAA TGATAGTAGA NCAAGTTGCG CATACAGTNT 180 

15 0 CTCNATTATA TGATGCTTTA GAATCGAATG AGCAACAACA GCGCAGTTAC AATCAATAAT 240 

151 TTGTAACTAG AAGATAATAA AGAGAACGCT CTATAGAGAC GAATTGAAGG TTTGATTTTA 3 00 

152 ATGTCTGTTA GTAAGAATCA TATCAATGAG ATGCCTATAG TACTCAGATT ATATTAAATT 360 

153 AAAACCGTCA TTAATTGTTT TTTTAGAAAA CATATAGTAT CATTTTAAAT GTAGTTGACA 420 

154 TACTACGTAC TCAAATAATC TATAACT^TT TCATATATAA TTCTTTCGGG GCAGGGTGAA 480 

155 ATTCCCAACC GGCAGTAAAT AAAGCCTGCG ACCTGCTAAT ATGTTTCATA TTAGTGGCTG 540 

156 ATCTAGTGAG ATTCTAGAGC CGACAGTTAA AGTCTGGATG GGAGAAAGAA TGTTAATTAT 6 00 

157 CGACAAAGAT AATGTAGCGT ATTTGTAAAA ATGTGTACAA ATAGGCTTAT TTAACGATAA 660 
15 8 ATTTTTCTCC TTTGCATCTT AATTCATGAT GTGAGGATTT TTTGTTTATA GAGGTGATCA 720 

159 TTTGAGTCAA TTTATGGATT ATGCGATTCA ACTTCCAAAT ATGGTACAAG GTCANACAGG 780 

160 TGTTAATCCA CCCGTTGGCG CTGTTGTAGT TAATGAAGGT AGGATTGTTG GTATTGGTGC 840 

161 ACACTTGAGA AAAGGTGACA AGCATGCGGA GGTTCAAGCA CTTGATATGG CACAACANAA 900 

162 TGCTGAAGGT GCGACGATTT ATATTACGTT AGAGCCATGT AGTCATTTTG GTTCAACACC 960 

163 ACCCTGTGTT AACAAAATTA TTGATTGTAA GATAGCANAA GTAGTATTAC NCAACANAAG 1020 

164 ACAATTCCGT TAGACACACA TGGGTGATGA GACGTTACGG GGCTCCACGG TATTTGAGGG 1080 

165 TTGAATTGCG TTGGATGATG AACGGGCATC ACAATTATAC CAAAGACTTT TTTTAAAGCA 1140 

166 AAAAGCAAAG CAACTTGCCA CAAAATTACA GTGAAAGTNT CTTGAAAGTT TAGATGGGTA 1200 

167 AACAAAGCGA ATTGATAATG GACAAAGTCA ATGGATTACT AACAAAGAGG TTAAACAAGA 1260 

168 TGTCTATAG ;L269 
170 (2) INFORMATION FOR SEQ ID NO: 4: 

172 (i) SEQUENCE CHARACTERISTICS: 
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173 (A) LENGTH: 99 amino acids 

174 (B) TYPE: amino acid 

175 (C) STRANDEDNESS : single 

176 (D) TOPOLOGY: linear 

179 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

W--> 181 Met Asp Tyr Ala He Gin Leu Pro Asn Met Val Gin Gly Xaa Thr Gly 

182 15 10 15 

183 Val Asn Pro Pro Val Gly Ala Val Val Val Asn Glu Gly Arg He Val 

184 20 25 30 

185 Gly He Gly Ala His Leu Arg Lys Gly Asp Lys His Ala Glu Val Gin 

186 35 40 45 

187 Ala Leu Asp Met Ala Gin Xaa Asn Ala Glu Gly Ala Thr He Tyr He 

188 50 55 60 

189 Thr Leu Glu Pro Cys Ser His Phe Gly Ser Thr Pro Pro Cys Val Asn 

190 65 70 75 80 

191 Lys He He Asp Cys Lys He Ala Xaa Val Val Leu Xaa Asn Xaa Arg 

192 85 90 95 

193 Gin Phe Arg 

196 (2) INFORMATION FOR SEQ ID NO: 5: 

198 (i) SEQUENCE CHARACTERISTICS: 

199 (A) LENGTH: 21 base pairs 

200 (B) TYPE: nucleic acid 

2 01 (C) STRANDEDNESS: single 

202 (D) TOPOLOGY: linear 

205 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

207 ATGAAGGTAG GATTGTTGGT A 21 

209 (2) INFORMATION FOR SEQ ID NO: 6: 

211 (i) SEQUENCE CHARACTERISTICS: 

212 (A) LENGTH: 22 base pairs 

213 (B) TYPE: nucleic acid 

214 (C) STRANDEDNESS: single 

215 (D) TOPOLOGY: linear 

218 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

22 0 AGTCTTGGTA TAATTGTGAT GC 22 

222 (2) INFORMATION FOR SEQ ID NO: 7: 

224 (i) SEQUENCE CHARACTERISTICS: 

225 (A) LENGTH: 3336 base pairs 

226 (B) TYPE: nucleic acid 

227 (C) STRANDEDNESS: double 

22 8 (D) TOPOLOGY: linear 

231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

233 ATGGATTATG CGATTCAACT TGCAAATATG GTACAAGGTC AAACAGGTGT TAATCCACCC 60 

2 34 GTTGGCGCTG TTGTAGTTAA TGAAGGTAGG ATTGTTGGTA TTGGTGCACA CTTGAGAAAA 12 0 

235 GGTGACAAGC ATGCGGAGGT TCAAGCACTT GATATGGCAC AACAAAATGC TGAAGGTGCG 180 

236 ACGATTTATA TTACGTTAGA GCCATGTAGT CATTTTGGTT CAACACCACC CTGTGTTAAC 240 

23 7 AAAATTATTG ATTGTAAGAT AGCAAAAGTA GTATACGCAA CAAAAGACAA TTCGTTAGAC 3 00 
238 ACACATGGTG ATGAGACGTT ACGGGCTCAC GGTATTGAGG TTGAATGCGT TGATGATGAA 360 
23 9 CGGGCATCAC AATTATACCA AGACTTTTTT AAAGCAAAAG CAAAGCAACT GCCACAAATT 420 
240 ACAGTGAAAG TATCTGCAAG TTTAGATGGT AAACAAGCGA ATGATAATGG ACAAAGTCAA 480 
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241 TGGATTACTA ACAAAGAGGT TAAACAAGAT GTCTATAAGT TAAGACATCG ACACGACGCA 

242 GTGTTAACTG GAAGACGTAC AGTTGAATTA GATGATCCAC AATATACTAC ACGTATTCAA 

243 GATGGAAAAA ACCCTATAAA AGTAATATTG TCTAAGTCTG GGAATATTCA TTTTAATCAG 

244 CAAATTTATC AAGATGAATC AACACCAATT TGGATATATA CTGAAAATCC AAATTTAACA 

245 AGCAATCAAA CACATATTGA AATTATTTAC TTGAAGTCTT GTGATTTAAC AACAATTCTT 

246 CACAATTTAT ATAAAAGAGG AGTTGGAACT TTGCTAGTCG AGGCAGGTCC AACCACTACT 

247 TCAGAATTCT CCATCTATTA TATAGATGAA TTTATTCTCT ATTATGCCCC GAAATTAATT 

248 GGCGGATCTG GAAATTATCA ATTTTATCAA ACAAATGATG TGATTGAGAT ACCAGATGCG 

249 AACCAATTTG AAATTGTTCA TTCCGAGTTA TTAAATCAAA ATGTTAAATT AACTTTACGA 

250 AAGAAGTGAT GATGCATGTT TACTGGCATC GTTGAAGAAA TAGGTGTCGT TAAAAGTGTT 

251 CAAATTCGTC AATCAGTCAG GACGATTGAA ATCGAAGCAC ATAAGATTAC GGCAGATATG 

252 CATATTGGTG ATTCCATCAG TGTTAATGGT GCATGTTTAA CAGTGATTGA TTTTGATCAG 

253 ACATCTTTTA CTGTTCAAGT AATTAAAAGC ACTGAAAATA AAACCTATTT AGCAGATGTT 

254 AAGCGACAAT CAGAAGTAAA TTTAGAGCGT GCCATGAGTG GTAACGGTAG GTTTGGTGGA 

255 CATTTTGTCC TCGGTCATGT AGATGAACTA GGAACAGTTT CAAAAATAAA TGAAACAGCC 

256 AATGCCAAAA TTATTACGAT TCAATGTAGC CAACATATTA ATAATCAGTT AGTTAAGCAA 

257 GGGTCTATTA CTGTGGATGG TGTAAGTCTA ACGGTATTTG ATAAGCATGA TAACAGTTTT 

258 GACATTCATC TTATTCCAGA AACGAGGCGT TCAACGATTT TATCATCCAA AAAATTAGGA 

259 GATAAAGTAC ATTTAGAAAC AGACGTTTTG TTTAAATATG TTGAAAATAT TTTAAATAAA 

260 GATAAAGACC AATTATCTGT AGATAAATTA AGAGCATTTG GTTTTTAGGA GGGGTAGCAT 

261 GCAATTCGAT AATATTGACA GTGCTTTAAT GGCTTTAAAA AATGGAGAAA CAATTATTGT 

262 AGTAGATGAT GAGAATCGTG AAAATGAAGG TGATTTAGTA GCGGTTACTG AATGGATGAA 

263 CGATAATACC ATTAATTTTA TGGCGAAAGA AGCAAGGGGA TTAATATGCG CACCAGTGTC 

264 TAAAGATATT GCACAACGTT TGGATTTGGT ACAAATGGTT GATGATAACT CCGACATCTT 
2 65 TGGTACGCAA TTTACAGTGA GTATTGATCA TGTAGATACA ACAACAGGAA TTAGTGCTTA 

266 TGAACGTACA TTGACTGCCA AAAAGCTCAT TGATCCTAGT AGTGAAGCTA AAGATTTTAA 

267 TCGTCCTGGT CATTTATTTC CATTAGTAGC ACAAGATAAA GGCGTATTAG CTAGAAATGG 

268 ACACACAGAA GCGGCTGTTG ATTTAGCTAA ACTTACTGGT GCCAAGCCCG CTGGTGTCAT 
26 9 TTGTGAGATT ATGAATGATG ACGGCACGAT GGCGAAAGGA CAAGATTTAC AAAATTTTAA 

270 AGAAAAACAT CAATTAAAGA TGATTACGAT TGATGATTTA ATTGAATATC GTAAAAAATT 

271 AGAACCAGAA ATTGAATTTA AGGCAAAAGT GAAAATGCCT ACAGATTTCG GAACATTTGA 

272 TATGTATGGT TTTAAAGCGA CATACACAGA TGAAGAGATA GTTGTACTGA CAAAAGGTGC 
2 73 AATTCGACAA CATGAAAATG TACGCTTACA TTCTGCGTGC CTTACAGGCG ATATTTTCCA 

274 TAGTCAACGT TGTGATTGTG GTGCTCAACT TGAATCGTCT ATGAAGTATA TCAATGAACA 

275 TGGTGGCATG ATTATTTATC TACCTCAAGA AGGTCGTGGC ATAGGATTGT TAAACAAATT 

276 ACGCGCATAT GAATTAATTG AGCAAGGATA TGATACAGTA ACTGCAAATT TAGCATTAGG 
2 77 TTTTGATGAA GATTTACGAG ATTATCATAT TGCTGCACAG ATTTTAAAAT ATTTTAACAT 
278 CGAACATATC AATTTATTAA GTAATAATCC AAGTAAATTT GAGGGATTAA AACAATATGG 
2 79 CATTGATATT GCAGAAAGAA TTGAAGTTAT CGTACCAGAA ACGGTACATA ATCATGATTA 
2 80 TATGGTAACG AAAAAAATAA AAATGGGTCA TTTAATATAG GAGGACTTTA ACATGAATTT 

281 TGAAGGTAAA TTAATTGGAA AAGATTTGAA AGTTGCAATC GTAGTTAGTC GATTTAATGA 

282 TTTTATCACT GGAAGATTAC TTGAAGGTGC AAAAGATACT TTGATTCGAC ATGATGTTAA 
2 83 TGAAGACAAT ATTGATGTAG CATTTGTTCC TGGTGCGTTT GAAATTCCTT TAGTAGCTAA 
2 84 AAAATTAGCC TCATCAGGAA ATTATGATGC AATAATTACA TTAGGATGCG TAATTCGCGG 
2 85 TGCTACGTCT CATTATGATT ATGTTTGTAA TGAAGTGCGA AAGGTGTTTC TAAAGTAAAT 
2 86 GATCAAACTA ATGTACCAGT CATATTTGGT ATTTTAACGA CTGAAAGTAT TGAACAAGCT 
287 GTGGAAAGAG CAGGTACGAA AGCTGGTAAT AAAGGTGCCG AAGCAGCAGT AAGTGCAATT 
2 88 GAAATGGCTA ATTTATTAAA ATCTATAAAA GCATAG 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/824,131 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing, Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:3; N Pes. 3,13,23,37,44,45,114,118,142,147,161,179,184,775,898,998 
Seq#:3; N Pes. 1011,1017,1179 
Seq#:4; Xaa Pos . 14 , 55 , 89 , 93 , 95 
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L:27 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:28 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:34 M:238 W: Alplia Fields not Ordered, Reordered [(A) APPLICATION NUMBER:] of (l) (vii) 
L:181 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos . : 0 
M:341 Repeated in SeqNo=4 
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